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(54) Forming a transparent window in a polishing pad for a chemical mechanical polishing 
apparatus 



(57) The disclosure relates to a polishing pad for a 
chemical mechanical polishing apparatus, and a meth- 
od for making the same. The polishing pad (18) has a 
covering layer (22) with a polishing surface (23) and a 
backing layer (20) which is adjacent to the platen (16). 
A first opening (630) in the covering layer with a first 



cross-sectional area and a second opening (632) in the 
backing layer with a second, different cross-sectional ar- 
ea form an aperture through the polishing pad. A sub- 
stantially transparent pofyurethane plug (600) is posi- 
tioned in the aperture, and an adhesive material fixes 
the plug in the aperture. 
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Description 

This invention relates generally to semiconductor 
manufacture, and more particularly to a method for 
forming a transparent window in a polishing pad for use s 
in chemical mechanical polishing (CMP). 

In the process of fabricating modem semiconductor 
integrated circuits (ICs), it is necessary to form various 
material layers and structures over previously formed 
layers and structures. However, the prior formations of- io 
ten leave the top surface topography of an in-process 
wafer highly irregular, with bumps, areas of unequal el- 
evation, troughs, trenches, and/or other surface irregu- 
larities. These irregularities cause problems when form- 
ing the next layer. For example, when printing a photo- is 
lithographic pattern having small geometries over pre- 
viously formed layers, a very shallow depth of focus is 
required. Accordingly, it becomes essential to have a flat 
and planar surface, otherwise, some parts of the pattern 
will be in focus and other parts will not. In fact, surface 20 
variations on the order of less than 1000 A over a 25 x 
25 mm die would be preferable. In addition, if the irreg- 
ularities are not leveled at each major processing step, 
the surface topography of the wafer can become even 
more irregular, causing further problems as the layers 25 
stack up during further processing. Depending on the 
die type and the size of the geometries involved, the sur- 
face irregularities can lead to poor yield and device per- 
formance. Consequently, it is desirable to effect some 
type of plaharization, or leveling, of the IC structures. In 30 
fact, most high density IC fabrication techniques make 
use of some method to form a planarized wafer surface 
at critical points in the manufacturing process. 

One method for achieving semiconductor wafer 
planarization or topography removal is the chemical me- 35 
chanical polishing (CMP) process. In general, the chem- 
ical mechanical polishing (CMP) process involves hold- 
ing and/or rotating the wafer against a rotating polishing 
platen under a controlled pressure. As shown in Fig. 1 , 
a typical CMP apparatus 10 includes a polishing head 40 
12 for holding the semiconductor wafer 14 against the 
polishing platen 16. The polishing platen 16 is covered 
with a pad 18. This pad 18 typically has a backing layer 
20 which interfaces with the surface of the platen and a 
covering layer 22 which is used in conjunction with a *s 
chemical polishing slurry to polish the wafer 1 4. Howev- 
er, some pads have only a covering layer and no backing 
layer. The covering layer 22 is usually either an open 
cell foamed polyurethane (e.g. Rodel IC 1000) or a sheet 
of polyurethane with a grooved surface (e.g. Rodel so 
EX2000). The pad material is wetted with the chemical 
polishing slurry containing both an abrasive and chem- 
icals. One typical chemical slurry includes KOH (Potas- 
sium Hydroxide) and fumed-sitica particles. The platen 
is usually rotated about its central axis 24. In addition, ss 
the polishing head is usually rotated about its central 
axis 26, and translated across the surface of the platen 
16 via a translation arm 28. Although just one polishing 



head is shown in Fig. 1, CMP devices typically have 
more than one of these heads spaced circumferentially 
around the polishing platen. 

A particular problem encountered during a CMP 
process is in the determination that a part has been 
planarized to a desired flatness or relative thickness. In 
general, there is a need to detect when the desired sur- 
face characteristics or planar condition has been 
reached. This has been accomplished in a variety of 
ways. Early on, it was not possible to monitor the char- 
acteristics of the wafer during the CMP process. Typi- 
cally, the wafer was removed from the CMP apparatus 
and examined elsewhere. If the wafer did not meet the 
desired specifications, it had to be reloaded into the 
CMP apparatus and reprocessed. This was a time con- 
suming and labor-intensive procedure. Alternately, the 
examination might have revealed that an excess 
amount of material had been removed, rendering the 
part unusable. There was, therefore, a need in the art 
for a device which could detect when the desired surface 
characteristics or thickness had been achieved, in-situ, 
during the CMP process. 

Several devices and methods have been developed 
for the in-situ detection of endpoints during the CMP 
process. For instance, devices and methods that are as- 
sociated with the use of ultrasonic sound waves, and 
with the detection of changes in mechanical resistance, 
electrical impedance, or wafer surface temperature, 
have been employed. These devices and methods rely 
on determining the thickness of the wafer or a layer 
thereof, and establishing a process endpoint, by moni- 
toring the change in thickness. In the case where the 
surface layer of the wafer is being thinned, the change 
in thickness is used to determine when the surface layer 
has the desired depth. And, in the case of planarizing a 
patterned wafer with an irregular surface, the endpoint 
is determined by monitoring the change in thickness and 
knowing the approximate depth of the surface irregular- 
ities. When the change in thickness equals the depth of 
the irregularities, the CMP process is terminated. Al- 
though these devices and methods work reasonably 
well for the applications for which they were intended, 
there is still a need for systems which provide a more 
accurate determination of the endpoint. 

In general, in one aspect, the invention is directed 
to a polishing pad for a chemical mechanical polishing 
apparatus. The polishing pad comprises a polishing sur- 
face with an aperture formed therein. The aperture in- 
cludes a first section with a first dimension and a second 
section with a second dimension. A substantially trans- 
parent plug is positioned in the aperture. The plug has 
a first portion positioned in the first section of the aper- 
ture and a second portion positioned in the second sec- 
tion of the aperture. There is a means for securing the 
plug in the aperture. 

In general, in another aspect, the invention is direct- 
ed to a method of forming a polishing pad. An aperture 
is formed in the polishing pad such that the aperture in- 
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eludes a first section with a first dimension and a second 
section with a-second dimension. A substantially trans- 
parent plug is placed in the aperture. A first portion of 
the plug is positioned in the first section of the aperture 
and a second portion of the plug is positioned in the sec- 5 
ond section of the aperture. The plug is secured in the 
aperture. 

Implementations include the following. The secur- 
ing means may include an adhesive material. The first 
portion of the plug may have substantially the same di- 10 
mension as the first section of the aperture, and the sec- 
ond section of the plug may have substantially the same 
dimension as the second section of the aperture. The 
first dimension may be larger than the second dimen- 
sion. The plug may be a polyurethane material, and the *5 
adhesive may be an elastomeric polyurethane material. 
The first section of the aperture may be formed in a first 
layer, and the second section of the aperture may be 
formed in a second layer. The removing step may in- 
clude removing the first section from the first layer of the 20 
polishing pad and removing the second section from a 
second layer of the polishing pad. The durometer meas- 
urement of the first layer may be greater than the du- 
rometer measurement of the second layer The top sur- 
face of the plug may be coplanar with the polishing sur- 2s 
face, whereas the thickness of the second portion of the 
plug may be less than the depth of the second section 
of the aperture. 

Additional objects and advantages of the invention 
will be set forth in the description which follows, and in 30 
part will be obvious from the description, or may be 
learned by practice of the invention. The objects and ad- 
vantages of the invention may be realized by means of 
the instrumentalities and combinations particularly 
pointed out in the claims. 35 

The accompanying drawings, which are incorporat- 
ed and constitute a part of the specification, schemati- 
cally illustrate an embodiment of the invention, and to- 
gether with the general description given above and the 
detailed description given below, serve to explain the *o 
principles of the invention. 

FIG. 1 is a side view of a chemical mechanical pol- 
ishing (CMP) apparatus typical of the prior art. 

FIG. 2 is a side view of a chemical mechanical pol- 
ishing apparatus with endpoint detection constructed in 45 
accordance with the present invention. 

FIGS. 3A-D are simplified cross-sectional views of 
respective embodiments of the window portion of the 
apparatus of Fig. 2. 

FIG. 3E is a simplified top view of the transparent 50 
plug used in the window portion of FIG. 3D. 

FIG. 3F is a simplified cross-sectional view illustrat- 
ing the assembly of the window portion of FIG. 3D. 

FIG. 4 is a simplified cross-sectional view of a win- 
dow portion of the apparatus of Fig. 2, showing compo- 55 
nents of a laser interferometer generating a laser beam 
and detecting a reflected interference beam. 

FIG. 5 is a simplified cross-sectional view of a blank 
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oxide wafer being processed by the apparatus of Fig. 2, 
schematically showing the laser beam impinging on the 
wafer and reflection beams forming a resultant interfer- 
ence beam. 

FIG. 6 is a simplified top view of the platen of the 
apparatus of Fig. 2, showing one possible relative ar- 
rangement between the window and sensor flag, and 
the sensor and laser interferometer. 

FIG. 7 is a top view of the platen of the apparatus 
of Fig. 2, showing a relative arrangement between the 
window and sensor flag, and the sensor and laser, 
where the window is in the shape of an arc. 

FIG. 8 is a flow chart of a method of piece-wise data 
acquisition in accordance with the present invention. 

FIGS. 9A-B are graphs showing the cyclic variation 
in the data signal from the laser interferometer over time 
during the thinning of a blank oxide wafer The graph of 
Fig. 9A shows the integrated values of the data signal 
integrated over a desired sample time, and the graph of 
Fig. 9B shows a filtered version of the integrated values. 

FIG. 1 0A is a block diagram of a backward-looking 
method of determining the endpoint of a CMP process 
to thin the oxide layer of a blank oxide wafer in accord- 
ance with the present invention. 

FIG. 1 0B is a block diagram of a forward-looking 
method of determining the endpoint of a CMP process 
to thin the oxide layer of a blank oxide wafer in accord- 
ance with the present invention. 

FIGS. 1 1 A-C are simplified cross-sectional views of 
a patterned wafer with an irregular surface being proc- 
essed by the apparatus of Fig. 2, wherein Fig. 11 A 
shows the wafer at the beginning of the CMP process, 
Fig. 11 B shows the wafer about midway through the 
process, and Fig. 1 1 C shows the wafer close to the point 
of planarization. 

FIG. 12 is a flow chart diagram of a method of de- 
termining the endpoint of a CMP process to planarize a 
patterned wafer with an irregular surface in accordance 
with the present invention. 

FIG. 1 3 is a graph showing variation in the data sig- 
nal from the laser interferometer over time during the 
planarization of a patterned wafer. 

FIG. 1 4 is a block diagram of a method of determin- 
ing the endpoint of a CMP process to control the film 
thickness overlying a particularly sized structure, or 
group of similarly sized structures, in accordance with 
the present invention. 

FIG. 15A is a simplified cross-sectional view of a 
wafer with a surface imperfection being illuminated by 
a narrow-diameter laser beam. 

FIG. 15B is a simplified cross-sectional view of a 
wafer with a surface imperfection being illuminated by 
a wide-diameter laser beam. 

FIG. 16 is a graph showing the cyclic variation in 
the data signal from the laser interferometer over time 
during the thinning of a blank oxide wafer and including 
the high frequency signal associated with a nonuniform 
wafer surface. 
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FIG. 17 is a schematic representation of a CMP sys- 
tem including an interferometer and a computer pro- 
grammed to analyze and respond to the output signal 
of interferometer waveform. 

FIG. 1 8 is a block diagram showing the functionality s 
that is implemented within the computer to perform in- 
situ monitoring of uniformity. 

FIGS. 1 9(a)-(c) show examples of an interferometer 
signal, the interferometer signal after it has been filtered 
by a low frequency bandpass pass filter, and the inter- 
ferometer signal after it has been filtered by a high fre- 
quency bandpass pass filter, respectively. 

FIG. 20(a) -(b) are flow charts showing the proce- 
dure for generating and then using a signature of a CMP 
system to qualify it for production use. 

FIG. 21 (a) is simplified cross-sectional view of an 
embodiment of the window portion of the apparatus of 
Fig. 2 employing the polishing pad as the window, and 
showing a reflection from the backside of the pad. 

FIG. 21 (b) is a graph showing the cyclical variation 
in the data signal from the laser interferometer over time 
with a large DC component caused by the reflection 
from the backside of the pad of the embodiment of Fig. 
21(a). 

FIG. 21(c) is simplified cross-sectional view of an 
embodiment of the window portion of the apparatus of 
Fig. 2 employing the polishing pad as the window with 
a diffused backside surface to suppress reflections. 

FIG. 21 (d) is a graph showing the cyclical variation 
in the data signal from the laser interferometer over time 
without the large DC component caused by reflection 
from the backside of the pad as a result of the diffuse 
backside surface of the embodiment of Fig. 21(c). 

Fig. 2 depicts a portion of a CMP apparatus modi- 
fied in accordance with one embodiment of the present 
invention. A hole 30 is formed in the platen 16 and the 
overlying platen pad 18. This hole 30 is positioned such 
that it has a view of the wafer 1 4 held by a polishing 
head 1 2 during a portion of the platen' s rotation, regard- 
less of the translational position of the head 12. A laser 
interferometer 32 is fixed below the platen 16 in a posi- 
tion enabling a laser beam 34 projected by the laser 
interferometer 32 to pass through the hole 30 in the plat- 
en 16 and strike the surface of the overlying wafer 14 
during a time when the hole 30 is adjacent the wafer 14. 

A detailed view of the platen hole 30 and wafer 14 
(at a time when it overlies the platen hole 30) are shown 
in Figs. 3A-C. As can be seen in Fig. 3A, the platen hole 
30 has a stepped diameter, thus forming a shoulder 36. 
The shoulder 36 is used to contain and hold a quartz 
insert 38 which functions as a window for the laser beam 
34. The interface between the platen 16 and the insert 
38 is sealed, so that the portion of the chemical slurry 
40 finding its way between the wafer 14 and insert 38 
cannot leak through to the bottom of the platen 16. The 
quartz insert 38 protrudes above the top surface of the 
platen 16 and partially into the platen pad 18. This pro- 
trusion of the insert 38 is intended to minimize the gap 
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between the top surface of the insert 38 and the surface 
of the wafer 14. By minimizing this gap, the amount of 
slurry 40 trapped in the gap is minimized. This is advan- 
tageous because the slurry 40 tends to scatter light 
traveling through it, thus attenuating the laser beam 
emitted from the laser interferometer 32. The thinner the 
layer of slurry 40 between the insert 38 and the wafer 
14, the less the laser beam 34 and light reflected from 
the wafer, is attenuated. It is believed a gap of approx- 
imately 1 mm would result in acceptable attenuation val- 
ues during the CMP process. However, it is preferable 
to make this gap even smaller. The gap should be made 
as small as possible while still ensuring the insert 38 
does not touch the wafer 1 4 at any time during the CMP 
process. In a tested embodiment of the present inven- 
tion, the gap between the insert 38 and wafer 14 was 
set at 10 mils (250 \um) with satisfactory results. 

Fig. 3B shows an alternate embodiment of the plat- 
en 16 and pad 18. In this embodiment, the quartz insert 
has been eliminated and no through-hole exists in the 
pad 18. Instead, the backing layer 20 (if present) of the 
pad 18 has been removed in the area overlying the hole 
30 in the platen 16. This leaves only the polyurethane 
covering layer 22 of the pad 18 between the wafer 14 
and the bottom of the platen 1 6. It has been found that 
the polyurethane material used in the covering layer 22 
will substantially transmit the laser beam 34 from the la- 
ser interferometer 32. Thus, the portion of the covering 
layer 22 which overlies the platen hole 30 functions as 
a window for the laser beam 34. This alternate arrange- 
ment has significant advantages. First, because the pad 
18 itself is used as the window, there is no appreciable 
gap. Therefore, very little of the slurry 40 is present to 
cause the detrimental scattering of the laser beam. An- 
other advantage of this alternate embodiment is that pad 
wear becomes irrelevant. In the first-described embod- 
iment of Fig 3A, the gap between the quartz insert 38 
and the wafer 14 was made as small as possible. How- 
ever, as the pad 18 wears, this gap tends to become 
even smaller. Eventually, the wear could become so 
great that the top surface of the insert 38 would touch 
the wafer 14 and damage it. Since the pad 18 is used 
as the window in the alternate embodiment of Fig 3B, 
and is designed to be in contact with the wafer 14, there 
are no detrimental effects due to the wearing of the pad 
18. It is noted that tests using both the opaque open-cell 
and transparent grooved surface types of pads have 
shown that the laser beam is less attenuated with the 
transparent grooved surface pad. Accordingly, it is pref- 
erable that this type of pad be employed. 

Although the polyurethane material used in the cov- 
ering layer of the pad is substantially transmissive to the 
laser beam, it does contain certain additives, such as 
nylon microspheres, which inhibit its transmissiveness. 
This problem is eliminated in the embodiment of the in- 
vention depicted in Fig. 3C. In this embodiment, the typ- 
ical pad material in the region overlying the platen hole 
30 has been replaced with a solid polyurethane plug 42. 
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This plug 42, which functions as the window lor the laser 
beam, is made of a polyurethane material which lacks 
the nylon microspheres. Accordingly, the attenuation of 
the laser beam 34 through the plug 42 is minimized. The 
plug 42 may be integrally molded into the pad 18. s 

For example, the plug may be formed by pouring 
liquid polyurethane into a hole that has been cut in the 
polishing pad. The liquid polyurethane is cured to form 
a plug which is integrally molded into the polishing pad. 
Alternately, the plug 42 could be preformed as a solid 
insert. This insert could be placed in the bulk molten pol- 
ishing pad material, and then the entire assembly could 
be cured so that the material of the plug 42 and the ma- 
terial of the polishing pad 18 bond together. When the 
assembly is cooled, the polyurethane plug 42 would be 
integrally molded into the polishing pad. However, the 
material of the polishing pad 18, and specifically the cov- 
ering layer 22, is different from the material of the poly- 
urethane plug 42. Therefore when the assembly is 
cured, the material of the plug 42 tends to contract and 
buckle the window up or down. This causes either a cup 
which can accumulate slurry or a bump which can dam- 
age the wafer 14. 

Referring to FIGS. 3D, in another embodiment, a 
two-level plug 600 is positioned in the polishing pad 18 
above the platen hole 30. The two-level plug 600 is 
formed of a relatively transparent material which acts as 
a window for the laser beam. The material of the two- 
level plug 600 may be a substantially pure polyurethane 
available from Rodel of Newark, New Jersey, under the 
product name EX-2000. Such a material is chemically 
inert vis-a-vis the polishing process, and erodes at the 
same rate as the polishing pad. The two-level plug 600 
includes an upper plug portion 602 and a lower plug por- 
tion 604. The upper plug portion 602 fits into a hole or 
opening 630 in the covering layer 22 and the lower plug 
portion 604 fits into a hole or opening 632 in the backing 
layer 20. The top surface 606 of the upper plug portion 
602 is co-planar with the top surface 23 of the polishing 
pad 1 8. There may be a gap 61 0 between the lower sur- 
face 608 of the lower plug portion 604 and the top sur- 
face 17 of the platen 16. 

The application of a load from the wafer 1 4 on the 
polishing pad 18 will cause the backing layer 20 to com- 
press. Thus, the width of the gap 61 0 will decrease. The 
gap 61 0 is selected to be sufficiently wide that the lower 
surface 608 will not contact the upper surface 17 of the 
platen 16, even if the wafer 14 is positioned over the 
platen hole 30. The top surface 606 contacts the wafer 
14 but, due to the gap 610, does not exert pressure on 
it. Therefore, the denser material of the two-level plug 
600 does not create a locally increased load. Thus, the 
two-level plug 600 does not adversely affect the polish- 
ing of the wafer 14. 

Referring to FIGS. 3E and 3F, the polishing pad 18 
may be assembled as follows. The two-level plug 600 
is machined or molded from a solid piece of poly- 
urethane. An aperture 61 2 is cut into a polishing pad 18. 



Alternately, the polishing pad 18 may be integrally mold- 
ed with the aperture 612. The aperture 61 2 includes two 
sections. The first section of the aperture may be th 
hole 630 in covering layer 22 and the second section of 
the aperture may be the hole 632 in the backing layer 
20. The aperture 612 matches the shape of two-level 
plug 600. The plug may be in the form of adjacent rec- 
tangular slabs having different cross-sectional areas. 
Specifically, the cross-sectional area of the lower plug 
portion 604 may be larger than the cross-sectional area 
of the upper plug portion 602. The upper plug portion 
602 may have a length L, of about 2.0 inches and a 
height of about 0.5 inches The lower plug portion 604 
may have a length L 2 of about 2.2 inches and a height 
H 2 of about 0.7 inches. Thus, the lower plug portion 604 
extends beyond the upper plug portion 602 to form a rim 
616 having a width of about 0.1 inches. The. plug 
may be oriented so that its longitudinal axis lies along a 
radius of the polishing pad. Although FIGS. 3D-F show 
the upper plug portion 602 as having a smaller cross- 
sectional area than the lower plug portion 604, this is 
not necessary. Instead, the upper plug portion 602 may 
be smaller than the lower plug portion 604. The upper 
plug portion 602 has a thickness T, equal to the thick- 
ness of covering layer 22, i.e., about fifty mils. Thus, the 
thickness T 1 is equal to the depth D-, of the first section 
of the aperture. The jower plug portion 604 is thinner 
than the backing layer 20 by about ten mils. The lower 
plug portion 604 may have a thickness T 2 of about forty 
mils. Thus, the thickness T 2 is less than the depth D 2 of 
the second section of the aperture. 

An adhesive material 614 is placed on the rim 616 
of the lower plug portion 604. The adhesive material 61 4 
may be an elastomeric polyurethane available from Ber- 
man Industries of Van Nuys, California under the trade 
name WC-575 A/B. Other adhesive materials, such as 
rubber cement or an epoxy, may also be used for the 
adhesive material 614. 

An area 618 on the underside of covering layer 22 
is cleaned by scraping off any adhesive debris and 
washing the area with acetone. Then the two-level plug 
600 is inserted into the aperture 612 until the rim 616 of 
plug 600 contacts the area 618 of the polishing pad 18. 
This contact area is placed under a load of approximate- 
ly fifteen to twenty pounds per square inch. This forces 
the adhesive material 614 into the gaps between upper 
plug portion 602 and covering layer 22 or between lower 
plug portion 604 and backing layer 20. After a few days 
at room temperature, the adhesive material 614 will 
have cured and the plug 600 will be fixed in the aperture 
612. The adhesive material 614 could be cured more 
quickly by the application of heat, but an excessive tem- 
perature may deform the backing material 20. 

There may be grooves or pores 620 cut into the cov- 
ering layer 22 of the polishing pad 18 to provide for im- 
proved slurry distribution. These grooves or pores 620, 
which are located above the lower plug portion 604, are 
filled with a pure polyurethane material 622. In addition, 
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the top surface 606 of the two-level plug 600 is left un- 
grooved. Because there are no grooves or depressions 
in the area of the two-level plug 600, there is no accu- 
mulation of slurry which could block the laser beam 34. 
During the conditioning process, in which a pad condi- s 
tioner grinds away the top surface 23 of the covering 
layer 22 to restore the roughness of the polishing pad 
18, the top surface 606 of two-level plug 600 will be 
scratched and abraded. Because polyurethane is a dif- 
fusive material, the abrasion of the top surface 606 will 
not significantly affect the performance of the laser inter- 
ferometer 32. 

The window provided by the two-level plug 600 pre- 
vents the accumulation of slurry above the platen hole 
30 which could block the laser beam 34. The plug 600 
is formed of a material which is chemically resistant to 
the slurry 40 and is chemically inert vis-a-vis the polish- 
ing process. The plug erodes at the same rate as the 
rest of the polishing pad 18. The plug is sealed within 
the aperture to prevent the leakage of the slurry 40 into 
the platen hole 30, and the plug may be depressed to 
prevent the wafer from experiencing a locally increased 
load. 

In operation, a CMP apparatus in accordance with 
the present invention uses the laser beam from the laser 
interferometer to determine the amount of material re- 
moved from the surface of the wafer, or to determine 
when the surface has become ptanarized. The begin- 
ning of this process will be explained in reference to Fig 
4. It is noted that a laser and collimator 44, beam splitter 
46, and detector 48 are depicted as elements of the la- 
ser interferometer 32. This is done to facilitate the afore- 
mentioned explanation of the operation of the CMP ap- 
paratus. In addition, the embodiment of Fig. 3A employ- 
ing the quartz insert 38 as a window is shown for con- 
venience. Of course, the depicted configuration is just 
one possible arrangement, others can be employed. For 
instance, any of the aforementioned window arrange- 
ments could be employed, and alternate embodiments 
of the laser interferometer 32 are possible. One alter- 
nate interferometer arrangement would use a laser to 
produce a beam which is incident on the surface of the 
wafer at an angle. In this embodiment, a detector would 
be positioned at a point where light reflecting from the 
wafer would impinge upon it. No beam splitter would be 
required in this alternate embodiment. 

As illustrated in Fig. 4, the laser and collimator 44 
generate a collimated laser beam 34 which is incident 
on the lower portion of the beam splitter 46. A portion of 
the beam 34 propagates through the beam splitter 46 
and the quartz insert 38. Once this portion of beam 34 
leaves the upper end of the insert 38, it propagates 
through the slurry 40, and impinges on the surface of 
the wafer 14. The wafer 14, as shown in detail in Fig. 5 
has a substrate 50 made of silicon and an overlying ox- 
ide layer 52 (i.e. Si0 2 ). 

The portion of the beam 34 which impinges on the 
wafer 14 will be partially reflected at the surface of the 



oxide layer 52 to form a first reflected beam 54. Howev- 
er, a portion of the light will also be transmitted through 
the oxide layer 52 to form a transmitted beam 56 which 
impinges on the underlying substrate 50. At least some 
of the light from the transmitted beam 56 reaching the 
substrate 50 will be reflected back through the oxide lay- 
er 52 to form a second reflected beam 58. The first and 
second reflected beams 54, 58 interfere with each other 
constructively or destructively depending on their phase 
relationship, to form a resultant beam 60, where the 
phase relationship is primarily a function of the thickness 
of the oxide layer 52. 

Although, the above-described embodiment em- 
ploys a silicon substrate with a single oxide layer, those 
skilled in the art will recognize the interference process 
would also occur with other substrates and other oxide 
layers. The key is that the oxide layer partially reflects 
and partially transmits, and the substrate at least par- 
tially reflects, the impinging beam. In addition, the inter- 
ference process may also be applicable to wafers with 
multiple layers overlying the substrate. Again, if each 
layer is partially reflective and partially transmissive, a 
resultant interference beam will be created, although it 
will be a combination of the reflected beams from all the 
layer and the substrate. 

Referring again to Fig. 4, it can be seen the resultant 
beam 60 representing the combination of the first and 
second reflected beams 54, 58 (Fig. 5) propagates back 
through the slurry 40 and the insert 38, to the upper por- 
tion of the beam splitter 46. The beam splitter 46 diverts 
a portion of the resultant beam 60 towards the detector 
48. 

The platen 16 will typically be rotating during the 
CMP process. Therefore, the platen hole 30 will only 
have a view of the wafer 14 during part of its rotation. 
Accordingly, the detection signal from the laser interfer- 
ometer 32 should only be sampled when the wafer 14 
is impinged by the laser beam 34. It is important that the 
detection signal not be sampled when the laser beam 
34 is partially transmitted through the hole 30, as when 
a portion is blocked by the bottom of the platen 1 6 at the 
hole' s edge, because this will cause considerable noise 
in the signal. To prevent this from happening, a position 
sensor apparatus has been incorporated. Any well 
known proximity sensor could be used, such as Hall ef- 
fect, eddy current, optical interrupter, or acoustic sensor, 
although an optical interrupter type sensor was used in 
the tested embodiments of the invention and will be 
shown in the figures that follow. An apparatus accord- 
ingly to the present invention for synchronizing the laser 
interferometer 32 is shown in Fig. 6, with an optical in- 
terrupter type sensor 62 (e.g. LED/photodiode pair) 
mounted on a fixed point on the chassis of the CMP de- 
vice such that it has a view of the peripheral edge of the 
platen 16. This type of sensor 62 is activated when an 
optical beam it generates is interrupted. A position sen- 
sor flag 64 is attached to the periphery of the platen 16. 
The point of attachment and length of the flag 64 is made 



15 



20 



25 



30 



35 



40 



45 



50 



11 



EP0 824 995 A1 



12 



such that it interrupts the sensor' s optical signal only 
when the laser beam 34 from the laser interferometer 
32 is completely transmitted through the previously-de- 
scribed window structure 66. For example, as shown in 
Fig. 6, the sensor 62 could be mounted diametrically op- s 
posite the laser interferometer 32 in relation to the center 
of the platen 16. The flag 64 would be attached to the 
platen 1 6 in a position diametrically opposite the window 
structure 66. The length of the flag 64 would be approx- 
imately defined by the dotted lines 68, although, the ex- io 
act length of the flag 64 would be fine tuned to ensure 
the laser beam is completely unblocked by the platen 
16 during the entire time the flag 64 is sensed by the 
sensor 62. This fine tuning would compensate for any 
position sensor noise or inaccuracy, the responsiveness is 
of the laser interferometer 32, etc. Once the sensor 62 
has been activated, a signal is generated which is used 
to determine when the detector signal from the interfer- 
ometer 32 is to be sampled. 

Data acquisition systems capable of using the po- 20 
sition sensor signal to sample the laser interferometer 
signal during those times when the wafer is visible to the 
laser beam, are well known in the art and do not form a 
novel part of the present invention. Accordingly, a de- 
tailed description will not be given herein. However 2s 
some considerations should be taken into account in 
choosing an appropriate system. For example, it is pre- 
ferred that the signal from the interferometer be integrat- 
ed over a period of time. This integration improves the 
signal-to-noise ratio by averaging the high frequency 30 
noise over the integration period. This noise has various 
causes, such as vibration from the rotation of the platen 
and wafer, and variations in the surface of the wafer due 
to unequal planarization. In the apparatus described 
above the diameter of the quartz window, and the speed 35 
of rotation of the platen, will determine how long a period 
of time is available during any one rotation of the platen 
to integrate the signal. However, under some circum- 
stances, this available time may not be adequate. For 
instance, an acceptable signal-to-noise ratio might re- 40 
quire a longer integration time, or the interface circuitry 
employed in a chosen data acquisition system may re- 
quire a minimum integration time which exceeds that 
which is available in one pass. 

One solution to this problem is to extend the platen 45 
hole along the direction of rotation of the platen. In other 
words, the window structure 66' (i.e. insert, pad, or plug) 
would take on the shape of an arc, as shown in Fig. 7. 
Of course, the flag 64' is expanded to accommodate the 
longer window structure 66'. Alternately, the window so 
could remain the same, but the laser interferometer 
would be mounted to the rotating platen directly below 
the window. In this case, the CMP apparatus would have 
to be modified to accommodate the interferometer be- 
low the platen, and provisions would have to be made ss 
to route the detector signal from the interferometer. 
However, the net result of either method would be to 
lengthen the data acquisition time for each revolution of 



the platen. 

Although lengthening the platen hole and window 
is advantageous, it does somewhat reduce the surface 
area of the platen pad. Therefore, the rate of planariza- 
tion is decreased in the areas of the disk which overlie 
the window during a portion of the platen' s rotation. In 
addition, the length of the platen hole and window must 
not extend beyond the edges of the wafer, and the data 
sampling must not be done when the window is beyond 
the edge of the wafer, regardless of the wafer 4 s trans- 
lational position. Therefore, the length of the expanded 
platen hole and window, or the time which the platen- 
mounted interferometer can be sampled, is limited by 
any translational movement of the polishing head. 

Accordingly, a more preferred method of obtaining 
adequate data acquisition integration time is to collect 
the data over more than one revolution of the platen. In 
reference to Fig. 8, during step 102, the laser interfer- 
ometer signal is sampled during the available data ac- 
quisition time in each rotation of the platen. Next, in 
steps 104 and 106, each sampled signal is integrated 
over the aforementioned data acquisition time, and the 
integrated values are stored. Then, in steps 108 and 
110, a cumulative sample time is computed after each 
complete revolution of the platen and compared to a de- 
sired minimum sample time. Of course, this would con- 
stitute only one sample time if only one sample has been 
taken. If the cumulative sample time equals or exceeds 
the desired minimum sample time, then the stored inte- 
grated values are transferred and summed, as shown 
in step 112. If not, the process of sampling, integrating, 
storing, computing the cumulative sample time, and 
comparing it to the desired minimum sample time con- 
tinues. In a final step 1 1 4, the summed integrated values 
created each time the stored integrated values are 
transferred and summed, are output as a data signal. 
The just-described data collection method can be imple- 
mented in a number of well known ways, employing ei- 
ther logic circuits or software algorithms. As these meth- 
ods are well known, any detailed description would be 
redundant and so has been omitted. It is noted that the 
method of piece-wise data collection provides a solution 
to the problem of meeting a desired minimum sample 
time no matter what the diameter of the window or the 
speed of platen rotation. In fact, if the process is tied to 
the position sensor apparatus, the platen rotation speed 
could be varied and reliable data would still be obtained. 
Only the number of platen revolutions required to obtain 
the necessary data would change. 

The aforementioned first and second reflected 
beams which formed the resultant beam 60, as shown 
in Figs. 4 and 5, cause interference to be seen at the 
detector 48. If the first and second beams are in phase 
with each other, they cause a maxima on detector 48. 
Whereas, if the beams are 180 degrees out of phase, 
they cause a minima on the detector 48. Any other 
phase relationship between the reflected beams will re- 
sult in an interference signal between the maxima and 
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minima being seen by the detector 48. The result is a 
signal output from the detector 48 that cyclically varies 
with the thickness ot the oxide layer 52, as it is reduced 
during the CMP process. In fact, it has been observed 
that the signal output from the detector 48 will vary in a 
sinusoidal-like manner, as shown in the graphs of Figs. 
9A-B. The graph of Fig. 9A shows the integrated ampli- 
tude of the detector signal (y-axis) over each sample pe- 
riod versus time (x-axis). This data was obtained by 
monitoring the laser interferometer output of the appa- 
ratus of Fig. 4, while performing the CMP procedure on 
a wafer having a smooth oxide layer overlying a silicon 
substrate (i.e. a blank oxide wafer). The graph of Fig. 
9B represents a filtered version of the data from the 
graph of Fig. 9 A. This filtered version shows the cyclical 
variation in the interferometer output signal quite clearly. 
It should be noted that the period of the interference sig- 
nal is controlled by the rate at which material is removed 
from the oxide layer during the CMP process. Thus, fac- 
tors such as the downward force placed on the wafer 
against the platen pad, and the relative velocity between 
the platen and the wafer determine the period. During 
each period of the output signal plotted in Figs. 9A-B, a 
certain thickness of the oxide layer is removed. The 
thickness removed is proportional to the wavelength of 
the laser beam and the index of refraction of the oxide 
layer. Specifically, the amount of thickness removed per 
period is approximately A/2n, where X is the freespace 
wavelength of the laser beam and n is the index of re- 
fraction of the oxide layer. Thus, it is possible to deter- 
mine how much of the oxide layer is removed, in-situ, 
during the CMP process using the method illustrated in 
Fig. 10A. First, in step 202, the number of cycles exhib- 
ited by the data signal are counted. Next, in step 204, 
the thickness of the material removed during one cycle 
of the output signal is computed from the wavelength of 
the laser beam and the index of refraction of the oxide 
layer of the wafer. Then, the desired thickness of mate- 
rial to be removed from the oxide layer is compared to 
the actual thickness removed, in step 206. The actual 
thickness removed equals the product of the number of 
cycles exhibited by the data signal and the thickness of 
material removed during one cycle; In the final step 208, 
the CMP process is terminated whenever the removed 
thickness equals or exceeds the desired thickness of 
material to be removed. 

Alternately, less than an entire cycle might be used 
to determine the amount of material removed. In this 
way any excess material removed over the desired 
amount can be minimized. As shown in the bracketed 
portions of the step 202 in Fig. 10A, the number of oc- 
currences of a prescribed portion of a cycle are counted 
in each iteration. For example, each occurrence of a 
maxima (i.e. peak) and minima (i.e. valley), or vice ver- 
sa, would constitute the prescribed portion of the cycle. 
This particular portion of the cycle is convenient as maxi- 
ma and minima are readily detectable via well know sig- 
nal processing methods. Next, in step 204, after deter- 



mining how much material is removed during a cycle, 
this thickness is multiplied by the fraction of a cycle that 
the aforementioned prescribed portion represents. For 
example in the case of counting the occurrence of a 

5 maxima and minima, which represents one-half of a cy- 
cle, the computed one-cycle thickness would be multi- 
plied by one-half to obtain the thickness of the oxide lay- 
er removed during the prescribed portion of the cycle. 
The remaining steps in the method remain unchanged. 

10 The net result of this alternate approach is that the CMP 
process can be terminated after the occurrence of a por- 
tion of the cycle. Accordingly, any excess material re- 
moved will, in most cases, be less than it would have 
been if a full cycle where used as the basis for deter- 

is mining the amount of material removed. 

The just-described methods look back from the end 
of a cycle, or portion thereof, to determine if the desired 
amount of material has been removed. However, as in- 
ferred above, the amount of material removed might ex- 

20 ceed the desired amount. In some applications, this ex- 
cess removal of material might be unacceptable. In 
these cases, an alternate method can be employed 
which looks forward and anticipates how much material 
will be removed over an upcoming period of time and 

25 terminates the procedure when the desired thickness is 
anticipated to have been removed. A preferred embod- 
iment of this alternate method is illustrated in Fig. 10B. 
As can be seen, the first step 302 involves measuring 
the time between the first occurrence of a maxima and 

30 minima, or vice versa, in the detector signal (although 
an entire cycle or any portion thereof could have been 
employed). Next, in step 304, the amount of material re- 
moved during that portion of the cycle is determined via 
trie previously described methods. A removal rate is 

35 then calculated by dividing the amount of material re- 
moved by the measured time, as shown in step 306. 
This constitutes the rate at which material was removed 
in the preceding portion of the cycle. In the next step 
308, the thickness of the material removed as calculated 

40 in step 304 is subtracted from the desired thickness to 
be removed to determine a remaining removal thick- 
ness. Then, in step 310, this remaining removal thick- 
ness is divided by the removal rate to determine how 
much longer the CMP process is to be continued before 

45 its termination. 

It must be noted, however, that the period of the de- 
tector signal, and so the removal rate, will typically vary 
as the CMP process progresses. Therefore, the above- 
described method is repeated to compensate for this. In 

so other words, once a remaining time has been calculat- 
ed, the process is repeated for each occurrence of a 
maxima and minima, orvice versa. Accordingly, the time 
between the next occurring maxima and minima is 
measured, the thickness of material removed during the 

55 portion of the cycle represented by this occurrence of 
the maxima and minima (i.e. one-half) is divided by the 
measured time, and the removal rate is calculated, just 
as in the first iteration of the method. However, in the 
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next step 308, as shown in brackets, the total amount 
of material removed during all the previous iterations is 
determined before being subtracted trom the desired 
thickness. The rest of the method remains the same in 
that the remaining thickness to be removed is divided s 
by the newly calculated removal rate to determine the 
remaining CMP process time. In this way the remaining 
process time is recalculated after each occurrence of 
the prescribed portion of a cycle of the detector signal. 
This process continues until the remaining CMP proc- 10 
ess time wilt expire before the next iteration can begin. 
At that point the CMP process is terminated, as seen in 
step 312. Typically, the thickness to be removed will not 
be accomplished in the first one-half cycle of the detec- 
tor signal, and any variation in the removal rate after be- *s 
ing calculated for the preceding one-half cycle will be 
small. Accordingly, it is believe this forward-looking 
method will provide a very accurate way of removing just 
the desired thickness from the wafer. 

While the just-described monitoring procedure 20 
works well for the smooth -surfaced blank oxide wafers 
being thinned, it has been found that the procedure can- 
not be successfully used to planarize most patterned 
wafers where the surface topography is highly irregular. 
The reason for this is that a typical patterned wafer con- 25 
tains dies which exhibit a wide variety of differently sized 
surface features. These differently sized surface fea- 
tures tend to polish at different rates. For example, a 
smaller surface feature located relatively far from other 
features tends to be reduced faster than other larger f ea- 30 
tures. Fig. 11 A-C exemplify a set of surface features 72, 
74, 76 of the oxide layer 52 associated with underlying 
structures 78, 80, 82, that might be found on a typical 
patterned wafer 14, and the changes they undergo dur- 
ing the CMP process. Feature 72 is a relatively small 35 
feature, feature 74 is a medium sized feature, and fea- 
ture 76 is a relatively large feature: Fig. 11 A shows the 
features 72, 74, 76 before polishing, Fig 11 B shows the 
features 72, 74, 76 about midway through the polishing 
process, and Fig. 11 C shows the features 72, 74, 76 to- 40 
wards the end of the polishing process. In Fig. 11 A, the . 
smaller feature 72 will be reduced at a faster rate than 
either the medium or large features 74, 76. In addition, 
the medium feature 74 will be reduced at a faster rate 
than the large feature 76. The rate at which the features 45 
72, 74, 76 are reduced also decreases as the polishing 
process progresses. For example, the smaller feature 
72 will initially have a high rate of reduction, but this rate 
will drop off during the polishing process. Accordingly, 
Fig. 11 B shows the height of the features 72, 74, 76 50 
starting to even out, and Fig. 11 C shows the height of 
the features 72, 74, 76 essentially even. Since the dif- 
ferently sized features are reduced at different rates and 
these rates are changing, the interference signal pro- 
duced from each feature will have a different phase and 55 
frequency. Accordingly, the resultant interference sig- 
nal, which is partially made up of all the individual re- 
flections from each of the features 72, 74, 76, will fluc- 
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tuate in a seemingly random fashion, rather than the 
previously described periodic sinusoidal signal. 

However, as alluded to above, the polishing rates 
of the features 72, 74, 76 tend to converge closer to the 
point of planarization. Therefore, the difference in phase 
and frequency between the interference beams pro- 
duced by the features 72, 74, 76 tend to approach zero. 
This results in the resultant interference signal becom- 
ing recognizable as a periodic sinusoidal wave form. 
Therefore, it is possible to determine when the surface 
of a patterned wafer has become planarized by detect-, 
ing when a sinusoidal interference signal begins. This 
method is illustrated in Fig. 12. First, in step 402, a 
search is made for the aforementioned sinusoidal vari- 
ation in the interferometer signal. When the sinusoidal 
variation is discovered, the CMP procedure is terminat- 
ed, as shown in step 404. 

Ftg. 13 is a graph plotting the amplitude of the de- 
tector signal over time for a patterned wafer undergoing 
a CMP procedure. The sampled data used to construct 
this graph was held at its previous integrated value until 
the next value was reported, thus explaining the 
squared-off peak values shown. A close inspection 
shows that a discernible sinusoidal cycle begins to 
emerge at approximately 250 seconds. This coincides 
with the point where the patterned wafer first became 
planarized. Of course, in real-time monitoring of the 
interferometer* s output signal, it would be impossible to 
know exactly when the cycling begins. Rather, at least 
some portion of the cycle must have occurred before it 
can be certain that the cycling has begun. Preferably, 
no more than one cycle is allowed to pass before the 
CMP procedure is terminated. A one-cycle limit is a 
practical choice because it provides a high confidence 
that the cycling has actually begun, rather than the sig- 
nal simply representing variations in the noise caused 
by the polishing of the differently sized features on the 
surface of the wafer In addition, the one-cycle limit en- 
sures only a small amount of material is removed from 
the surface of the wafer after it becomes planarized. It 
has been found that the degree of planarization is es- 
sentially the same after two cycles, as it was after one. 
Thus, allowing the CMP procedure to continue would 
only serve to remove more material from the surface of 
the wafer. Even though one cycle is preferred in the case 
where the CMP process is to be terminated once the 
patterned wafer becomes planarized, it is not intended 
that the present invention be limited to that time frame. 
If the signal is particularly strong, it might be possible to 
obtain the same level of confidence after only a portion 
of a cycle. Alternately, if the signal is particularly weak, 
it may take more than one cycle to obtain the necessary 
confidence. The choice will depend on the characteris- 
tics of the system used. For instance, the size of the gap 
between the quartz window and the surface of the wafer 
will have an effect on signal strength, and so the deci- 
sion on how many cycles to wait before terminating the 
CMP process. 
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The actual determination as to when the output sig- 
nal from the laser interferometer is actually cycling, and 
so indicating that the surface of the wafer has been 
planarized can be done in a variety of ways. For exam- 
ple, the signal could be digitally processed and an algo- s 
rithm employed to make the determination. Such a 
method is disclosed in U.S. Patent 5,097,430, where the 
slope of the signal is used to make the determination. 
In addition, various well known curve fitting algorithms 
are available. These methods would essentially be used 
to compare the interferometer signal to a sinusoidal 
curve. When a match occurs within some predeter- 
mined tolerance, it is determined that the cycling has 
begun. Some semiconductor applications require that 
the thickness of the material overlying a structure 
formed on a die of a patterned wafer (i.e. the film thick- 
ness) be at a certain depth, and that this film thickness 
be repeatable from die to die, and from wafer to wafer. 
The previously described methods for planarizing a typ- 
ical patterned wafer will not necessarily produce this de- 
sired repeatable film thickness. The purpose of the 
plana rizat ion methods is to create a smooth and flat sur- 
face, not to produce a particular film thickness. Accord- 
ingly, if it is desirable to control the film thickness over 
a specific structure, or group of similarly sized struc- 
tures, an alternate method must be employed. This al- 
ternate method is described below. 

As alluded to previously, each differently sized sur- 
face feature resulting from a layer of oxide being formed 
over a patterned structure on a die tends to produce a 
reflected interference signal with a unique frequency 
and phase. It is only close to the point of planarization 
that the frequency and phase of each differently sized 
feature converges. Prior to this convergence the unique 
frequency and phase of the interference signals caused 
by the various differently sized features combine to pro- 
duce a detector signal that seems to vary randomly. 
However, it is possible to process this signal to eliminate 
the interference signal contributions of all the features 
being polished at different rates, except a particularly 
sized feature, or group of similarly sized features. Once 
the interference signal associated with the particularly 
sized feature, or group of features, has been isolated, 
the methods discussed in association with the removal 
of material from a blank oxide disk are employed to re- 
move just the amount of material necessary to obtain 
the desired film thickness. 

Of course, the frequency of the interference signal 
component caused by the feature of interest must be 
determined prior to the signal processing. It is believed 
this frequency can be easily determined by performing 
a CMP process on a test specimen which includes dies 
exclusively patterned with structures corresponding to 
the structure which is to have a particular overlying film 
thickness. The detector signal produced during this 
CMP process is analyzed via well known methods to de- 
termine the unique frequency of the interference signal 
caused by the surface features associated with the 



aforementioned structures. 

The specific steps necessary to perform the above- 
described method of controlling the film thickness over 
a specific structure, or group of similarly sized structures 
on a die, in situ, during the CMP processing of a wafer, 
will now be described in reference to Fig. 14. In step 
502, the detector signal is filtered to pass only the com- 
ponent of the signal having the predetermined frequen- 
cy associated with the structure of interest. This step is 
accomplished using well known band pass filtering tech- 
niques. Next, in step 504 a measurement is made of the 
time between the first occurrence of a maxima and mini- 
ma, or vice versa, in the detector signal (although an 
entire cycle or any portion thereof could have been em- 
ployed). The amount of material removed during that 
portion of the cycle (i.e. one- half cycle) is determined in 
step 506 via previously described methods. Then, a re- 
moval rate is then calculated by dividing the amount of 
material removed by the measured time, as shown in 
step 508. This constitutes the rate at which material was 
removed in the preceding portion of the cycle. In the next 
step 510, the thickness of the material removed as cal- 
culated in step 506 is subtracted from the desired thick- 
ness to be removed (i.e. the thickness which when re- 
moved will result in the desired film thickness overlying 
the structure of interest), to determine a remaining re- 
moval thickness. Then, this remaining removal thick- 
ness is divided by the aforementioned removal rate to 
determine how much longer the CMP process is to be 
continued before it termination, in step 512. Once a re- 
maining time has been calculated, the process is repeat- 
ed for each occurrence of a maxima and minima, or vice 
versa. Accordingly, the time between the next occurring 
maxima and minima is measured, the thickness of ma- 
terial removed during the portion of the cycle represent- 
ed by this occurrence of the maxima and minima (i.e. 
one-half) is divided by the measured time, and the re- 
moval rate is calculated, just as in the first iteration of 
the method. However, in the next step 510, as shown in 
brackets, the total amount of material removed during 
all the previous iterations is determined before being 
subtracted from the desired thickness. The rest of the 
method remains the same in that the remaining thick- 
ness to be removed is divided by the newly calculated 
removal rate to determine the remaining CMP process 
time. This process is repeated until the remaining time 
expires before the next iteration can begin. At that point, 
the CMP process is terminated, as seen in step 514. 

It is noted that although the method for controlling 
film thickness described above utilizes the method for 
determining the CMP process endpoint illustrated in Fig. 
10B, any of the other endpoint determination methods 
described herein could also be employed, if desired. 

It is further noted that the beam diameter (i.e. spot) 
and wavelength of the laser beam generated by the la- 
ser interferometer can be advantageously manipulated. 
As shown in Figs. 1 5A and 1 5B, a narrow beam 84, such 
as one focused to the smallest spot possible for the 
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wavelength employed, covers a smaller area of the sur- 
face of the wafer 14 than a wider, less focused beam 
86. This narrow beam 84 is more susceptible to scatter- 
ing (i.e. beam 88) due to surface irregularities 90, than 
the wider beam 86, since the wider beam 86 spreads 5 
out over more of the surface area of the wafer 14, and 
encompasses more of the surface irregularities 90. 
Therefore, a wider beam 86 would have an integrating 
effect and would be less susceptible to extreme varia- 
tions in the reflected interference signal, as it travels 10 
across the surface of the wafer 14. Accordingly, a wider 
beam 86 is preferred for this reason. The laser beam 
width can be widened using well known optical devices. 

It must also be pointed out that the wider beam will 
reduce the available data acquisition time per platen is 
revolution since the time in which the beam is complete- 
ly contained within the boundaries of the window is less 
than it would be with a narrower beam. However, with 
the previously described methods of data acquisition, 
this should not present a significant problem. In addition, 20 
since the wider beam also spreads the light energy out 
over a larger area than a narrower beam, the intensity 
of the reflections will be lessen somewhat. This draw- 
back can be remedied by increasing the power of the 
laser beam from the laser interferometer so that the loss 25 
in intensity of the reflected beams is not a factor in de- 
tection. 

As for the wavelength of the laser beam, it is feasi- 
ble to employ a wavelength anywhere from the far infra- 
red to ultraviolet. However, it is preferred that a beam in 30 
the red light range be used. The reason for this prefer- 
ence is two-fold. First, shorter wavelengths result in an 
increase in the amount of scattering caused by the 
chemicai slurry because this scattering is proportional 
to the 4th power of the frequency of the laser beam, ss 
Therefore, the longer the wavelength, the less the scat- 
tering. However, longer wavelengths also result in more 
of the oxide layer being removed per period of the inter- 
ference signal, because the amount of material re- 
moved per period equals approximately X/2n. There- 40 
fore, the shorter the wavelength, the less material re- 
moved in one period. It is desirable to remove as little 
of the material as possible during each period so that 
the possibility of any excess material being removed is 
minimized. For example, in a system employing the pre- *s 
viously described method by which the number of cy- 
cles, or a portion thereof, are counted to determine the 
thickness of the oxide layer removed, any excess ma- 
terial removed over the desired amount would be mini- 
mized if the amount of material removed during each so 
cycle, or portion thereof, is as small as possible. 

It is believed these two competing factors in the 
choice of wavelength are optimally balance if a red light 
laser beam is chosen. Red light offers an acceptable de- 
gree of scattering while not resulting in an unmanagea- s $ 
ble amount of material being removed per cycle. 



Further Embodiments 

The generated interference waveform provides 
considerable additional information about the polishing 
process. This additional information can be used to pro- 
vide an in-situ measurement of the uniformity of the pol- 
ished layer. It can also be used to detect when the CMP 
system is not operating within spec (i.e., not operating 
as desired). Both of these uses will now be described. 

Uniformity Measurement : 

The polishing and/or planar izat ion operations which 
are performed on the CMP system are generally re- 
quired to produce a surface layer that is uniform across 
the surface of the wafer/substrate. In other words, the 
center of the wafer should polish at the same rate as the 
edge of the wafer. Typically, the thickness of the pol- 
ished layer must not vary by more than about 5-10%. If 
that level of uniformity is not achieved, it is likely that the 
wafer will not be usable since the device yields will be 
unacceptably low. In practice, it is often quite difficult to 
achieve a uniform polishing rate across the wafer. It typ- 
ically requires optimizing many different variables to 
keep it performing within the specs. The end point de- 
tector described above provides a very useful tool for 
monitoring the uniformity of the layer being polished and 
that monitoring can be performed both in-situ data ac- 
quisition and processing. 

We have discovered that the interference waveform 
that is produced by the interferometer during polishing 
provides information about the uniformity of the layer 
that is being polished. As noted above, the output of the 
interferometer appear as a sinusoidal signal as the sur- 
face layer (e.g. oxide layer) is being polished. The dis- 
tance between the peaks of that signal indicate how 
much material has been removed. On top of that sinu- 
soidal signal there will also be another higher frequency 
sinusoidal signal. The amplitude of the higher frequency 
signal indicates by how much the thickness of the pol- 
ished layer varies across the surface of the wafer. 

The reason that the high frequency signal appears 
is as follows. As the polishing is being performed, the 
interferometer typically samples (or looks at) different 
locations across the surface of the wafer. This is be- 
cause during polishing, both the platen and the wafer 
are rotating and in addition the wafer is also being 
moved axially relative to the platen. Thus, during polish- 
ing different areas of the wafer' s surface pass over the 
hole in the platen through which the interferometer sees 
the layer that is being polished. If the polished layer is 
completely uniform, the resulting interference waveform 
will be unaffected by the sampling of the different loca- 
tions across the wafer' s surface. That is, it will have sub- 
stantially the same amplitude. On the other hand, if the 
polished layer is not uniform, the sampling of different 
locations introduce, a further variation onto the sinusoi- 
dal base signal. This further variation has a frequency 
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that is dependent on the rotation . and sweep rates that 
are used and it has an amplitude that is proportional to 
the degree of nonuniformity of the polished layer. An ex- 
ample of such a waveform is shown in Fig. 16. In this 
particular example, the nonuniformity was relatively s 
large so as to clearly illustrate the high frequency signal. 

A measure of the uniformity is the ratio of the peak r 
to-peak amplitude A hf of the high frequency signal to the 
peak-to-peak amplitude A )f of the low frequency signal. 
The smaller this ratio, the more uniform the polished lay- 10 
er will be; and conversely, the larger this ratio, the more 
nonuniform it will be. 

A CMP system which produces a measure of uni- 
formity is shown in Fig. 1 7. In addition to the components 
shown in the previously described Fig. 2, it also includes is 
a computer 1 50, which is programmed to control the op- 
eration of the interferometer and to perform the signal 
analysis that is required to produce a measure of uni- 
formity from the interference signal, and it includes a dis- 
play unit 160 through which various information and re- 20 
suits are displayed to an operator. Computer 150 can 
be any device which is capable of performing the control 
and signal processing functions including, for example, 
a standard PC which is programmed appropriately and 
a dedicated, specially designed digital processing unit. 25 
Display unit 160 can be a video display, a printer, or any 
appropriate device or combination of devices for com- 
municating information to the operator of the CMP sys- 
tem. 

To generate a uniformity measure, computer 1 50 is 30 
programmed to implement and perform the signal 
processing and other functions shown in Fig. 18. In that 
regard, computer 150 implements two programmable 
bandpass filters, namely, a high frequency filter 152 and 
a low frequency filter 1 54. High frequency filter 1 52 has 35 
a passband centered on the frequency of the high fre- 
quency signal containing the uniformity information and 
low frequency filter 1 54 has a passband centered on the 
frequency of the low frequency signal containing the pol- 
ishing rate information. The width of both of these pass- *o 
bands is on the order of a few milliherz in the case when 
the period is on the order of tens of seconds. Indeed, 
the width of the passband is programmed to vary in pro- 
portion with the center frequency, or stated differently, 
to vary inversely to the period of the signal being exam- 4 & 
ined. That is, if the period of the relevant signal increas- 
es, the bandwidth of the passband filter decreases and 
vice versa. 

Fig. 1 9(a) shows an example of an interferometer 
signal obtain from an actual system. Note that initially so 
the signal indicates that the layer is quite uniform, i.e., 
no discernible high frequency signal is riding on top of 
the low frequency signal. After polishing has been per- 
formed for a short period of time, a high frequency signal 
begins to appear, indicating a certain level of nonuni- 55 
formity. Low frequency filter 1 54 selects the low frequen- 
cy component and filters out the other frequencies to 
produce an output signal of the form shown in Fig. 19 



(b). Similarly, high frequency filter 152 selects the high 
frequency component and filters out the other frequen- 
cies to produce an output signal of the form shown in 
Fig. 19(c). 

Computer 150 implements two amplitude measure- 
ment functions 156 and 158 which measure the peak- 
to-peak amplitudes of the output signals of fitters 152 
and 154, respectively. Once the amplitudes of the two 
filtered signals has been determined, computer 150 
computes a ratio of the p-p amplitude of the high fre- 
quency signal to the p-p amplitude of the low frequency 
signal (i.e., A h /A, f ) (see functional block 162). After the 
ratio has been computed, computer 1 50 compares (see 
block 166) the computed ratio to a threshold or refer- 
ence value 1 64 that was previously stored in local mem- 
ory. If the computed ratio exceeds the stored threshold 
value, computer 150 alerts the operator that nonuni- 
formity of the polished layer exceeds an acceptable 
amount. In response, the operator can adjust the proc- 
ess parameters to bring the process back into spec. 

Since the high frequency signal tends to appear on- 
ly after some polishing has been performed, it is useful 
to wait before attempting to measure nonuniformity. In- 
deed, it may be desirable to automatically compute the 
ratio periodically so as to monitor the uniformity of the 
polished layer throughout the polishing operation, in that 
case, it may also be desirable for computer 1 50 to output 
the computed ratios throughout the process so that the 
operator can detect changes and/or trends which are 
appearing in the polishing process. This would be par- 
ticularly useful if the in-situ monitoring was done during 
on actual production wafers during polishing. 

Note that the functions just described can be imple- 
mented through software that is running on the compu- 
ter or they can be implemented through dedicated cir- 
cuits built for this specific purpose. 

The bandpass filters can be implemented using 
techniques which are well known to persons skilled in 
the art. In the described embodiment, they are FIR (finite 
impulse response) filters which can be implemented in 
either the frequency or the time domain. However, to 
perform the filtering in real time as the interferometer 
signal becomes available, the filtering is done in the time 
domain by convolving the appropriate function with the 
waveform as it is being generated. The appropriate 
function is, of course, simply the time domain represen- 
tation of a bandpass filter having the desired character- 
istics (i.e., center frequency and bandwidth). 

To specify the appropriate filter parameters it is nec- 
essary to know the frequency of the signal that is to be 
selected by the filter. This information can be obtained 
easily from the interferometer signal waveform(s). For 
example, the center frequency for the low frequency fil- 
ter can be obtained by running a batch (e.g. 25) of wa- 
fers (e.g. blank wafers with only an oxide coating) to ob- 
tain an accurate measure of the polishing rate. Alterna- 
tively, the polishing rate can be determined at the start 
of a polishing run by measuring the distance between 
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peaks of the low frequency signal. Of course, using this 
alternative approach produces results that are not as ac- 
curate as averaging measurements over a larger 
number of wafers. In any case, the polishing rate deter- 
mines the center frequency of the bandpass filter and £ 
by knowing the center frequency .along with the desired 
bandwidth of the filter one can readily determine the pre- 
cise form of the time domain filter function and/or the 
coefficients of the FIR filter 

The frequency of the high frequency signal can be 10 
obtained in a similar manner, i.e., directly from the trace 
that is generated by the interferometer as the CMP sys- 
tem is polishing the wafer. In other words, the operator 
simply measures the distance between peaks of the 
high frequency signal. This process can be readily au- is 
tomated so that the operator, with the aid of a pointing 
device (e.g. a mouse), can mark two points on the wave- 
form appearing on a video display and the computer can 
be programmed to automatically compute the frequency 
and then generate the appropriate filter coefficients. The 20 
filter coefficients and/or time domain representation of 
the filter functions are then stored in focal memory for 
use later during the polishing runs to perform the filtering 
operations. 

25 

Process Signature : 

The interferometer waveform also represents a sig- 
nature of (i.e., it characterizes) the system for which it 
was obtained. Because of this, it provides information 30 
which is useful for qualifying a system for production op- 
eration. If a signature is obtained for a system that is 
known to be operating as desired, that signature wave- 
form (or features extracted from the waveform) can be 
used as a reference against which subsequently gener- 35 
ated signatures can be compared to determine whether 
the system or systems from which signatures were sub- 
sequently obtained are performing within spec. For ex- 
ample, if the polishing pads are changed or a new batch 
of slurry is used in the CMP system, the operator needs *o 
to know whether that change has detrimentally affected 
the quality of the polishing which the system performs. 
We have discovered that a change in performance of 
the CMP system results in a change in the signature. 
That is, certain features will appear in the waveform that 
were not previously present or previously existing fea- 
tures will change. By detecting those changes, it is pos- 
sible to detect when a system is not performing as de- 
sired. 

In the described embodiment, the extracted fea- so 
tures from the interferometer waveform are the polishing 
rate and the measure of uniformity. Both of these char- 
acteristics are readily obtainable from the interferometer 
waveform that is generated during polishing by using the 
methods previously described. A properly operating ss 
system will produce a particular polishing rate and a par- 
ticular measure of uniformity. A drift away from these 
reference values provides an indication that the system 



is moving away from its desired operating point and 
alerts the operator to the need for corrective action so 
as to avoid destroying product. 

A method which uses a CMP system signature is 
illustrated in Fig. 20a and will now be described. Initially, 
an interferometer waveform (i.e., a signature) is gener- 
ated for a CMP system which is known to be operating 
optimally (step 250). The decision as to whether the sys- 
tem is operating optimally can be determined empirically 
by processing a set of test wafers and analyzing the re- 
sults. When the results that are produced are within 
spec, then the signature can be generated for that con- 
figuration and set of operating conditions. Before cap- 
turing a portion of the interferometer waveform, it is de- 
sirable to polish the wafer between 50-100% of the way 
through the oxide so that the waveform is truly a signa- 
ture of the polishing set up. 

After the waveform has been obtained, certain rel- 
evant features are then extracted from the generated 
waveform (step 252) and stored for later use as a refer- 
ence against which to evaluate that system' s perform- 
ance at some later time or times (step 254). Alternative- 
ly, the waveform itself can be stored and used as the 
reference. In the described embodiment, the extracted 
features are the polishing rate and the measure of uni- 
formity, both of which can be determined from the wave- 
form as described above. 

Referring to Fig. 20b, at some later time the stored 
signature (or extracted features) can be used to qualify 
that system or another system for production use. To 
qualify a system for production, a new signature is ob- 
tained for that system (step 258) and the relevant fea- 
tures are extracted from that new signature (step 260). 
The extracted features are then compared to the stored 
reference set of features (step 264). If the operating 
point, as characterized by the set of extracted features, 
falls within a predetermined region around the reference 
point, as defined by the stored reference set of features, 
then it is concluded that the system is operating properly 
and that it can be brought online for processing product 
wafers (step 266). If this process is automated, the com- 
puter may at this point alert the operator that the process 
is within spec. On the other hand, if the operating point 
falls outside of the predetermined region, that is an in- 
dication that the system is not operating within spec and 
the operator is alerted to this problem so that corrective 
action can be taken (step 268). The corrective action 
might involve adjusting some process parameter appro- 
priately to bring the process within spec. For example, 
if the polishing rate is excessive or if oxide nonuniformity 
is larger than permitted, then the operator may recog- 
nize that it is appropriate to try a new batch of slurry, or 
to adjust the pressure on the pad, or to even replace the 
pad. The particular course of corrective action that is 
chosen will of course depend upon the details of how 
the system has departed from its desired operating 
point, the configuration and operating parameters of the 
particular system, and what the operator* s experience 
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has taught him. 

To provide further usetul information to the operator, 
the computer also optionally outputs through its display 
device(s) information about the extracted features (step 
262). The displayed information may be presented as 5 
the extracted features, the waveform, how close the var- 
ious extracted features are to the different features of 
the stored reference set, or in whatever manner proves 
to be most useful for the operator. 

Of course, the above-described in-situ, real time io 
monitoring procedure can be used periodically while 
processing production wafers or whenever some proc- 
ess parameter is changed in the CMP system (e.g. a 
new polishing pad is used, pad pressure is adjusted, or 
a new batch of slurry is used) and it becomes necessary is 
to know that the CMP process is still within spec. In ad- 
dition, it can be used on blank wafers, instead of actual 
product, to qualify the CMP system prior to using it on 
actual product. 

Though we have described a straight forward and 20 
simple approach to extracting information from the sig- 
nature waveform, i.e., by using the polishing rate and 
the measure of uniformity, the signature or interferom- 
eter waveform can be analyzed by using more sophis- 
ticated techniques (e.g. pattern or feature recognition or 2$ 
other image analysis algorithms, or neural networks, 
just to name a few alternatives). The information which 
various extracted features convey regarding the opera- 
tion of the system can be determined through experi- 
ence and the ones which convey the information that is 30 
perceived to be of most importance to the operator can 
be used. 

Also, it should be noted that simply displaying the 
interferometer waveform (i.e., the process signature) to 
the operator can be yield valuable feedback on how well 35 
the system is behaving. Typically, the human eye is ex- 
tremely sensitive in detecting even subtle changes in an 
image from what one expects to see. Thus, after gaining 
some experience, the operator will often be able to de- 
tect changes and imminent problems in the overall CMP *o 
system performance simply by looking at the waveform. 
Thus, in the described embodiment, the computer also 
displays the signature waveform to the operator during 
processing so that the operator can also use it to monitor 
equipment performance. 45 

Using techniques known to persons skilled in the 
art, one can readily develop software algorithms which 
automatically recognize or detect the changes for which 
the operator is looking and which tip off the operator to 
certain problems. so 

A Modification for Obtaining Improved Performance 

Another embodiment involves a modification to the 
window in the pad between the interferometer and the ss 
wafer. Although the pad will transmit a substantial por- 
tion of the interferometer laser beam, it has been found 
that there is also a significant reflective component from 



the bottom surface of the pad. This situation is illustrated 
in Fig. 21 (a) where part of the laser beam 34 emanating 
from the laser interferometer 32 is transmitted through 
the pad 22 to form a transmitted beam 702, and part of 
the laser beam 34 is reflected from the backside surface 
704 of the pad 22 to form a reflected beam 706. The 
reflected beam 706 creates a considerable direct cur- 
rent (DC) shift in the data signal. Fig. 21(b) illustrates 
this shift (although exaggerated for purposes of clarity). 
Inn this example, the DC shift resulting from the reflect- 
ed laser light adds about 8.0 volts to the overall signal. 
The DC shift creates problems in analyzing the useful 
portion of the data signal. For example, if the data anal- 
ysis equipment operates in a range of 0-10 volts, ampli- 
fication of the DC shifted signal to enhance the portion 
of interest is all but impossible without reducing or elim- 
inating the DC component of the signal. If the DC com- 
ponent is not eliminated, the equipment would be satu- 
rated by the amplified signal. Reducing or eliminating 
the DC component electronically requires added signal 
processing electronics and may result in a degradation 
of the useful portion of the signal. Even if the DC shift is 
not as large as described here, some signal processing 
will still likely be required to eliminate it. Accordingly, a 
non-electronic method of reducing or eliminating this un- 
wanted DC component is desirable. 

It has been found that by creating a diffuse surface 
704' on the backside of the pad 22 in the area constitut- 
ing the window, as depicted in Fig. 21(c), the reflected 
light from that surface is attenuated. Thus, the unwanted 
DC component of the data signal is reduced. The diffuse 
surface 704' in effect scatters the non-transmitted laser 
light 708 rather than reflecting most of it back towards 
the interferometer 32. The reflected signal from the wa- 
fer must also pass through the diffuse surface 704* and 
in doing so some of it will also be scattered. However, it 
has been found that this does not seriously degrade the 
performance of the interferometer. 

Fig. 21 (d) illustrates the data signal obtained when 
the diffuse surface 704' is employed. As can be seen, 
with the elimination of the DC component, the signal can 
be readily amplified and processed without the need to 
electronically eliminate any DC portion. 

How the diffuse surface is produced is not of central 
importance. It can be produced by sanding the back sur- 
face of the polishing pad in the vicinity of the window or 
by applying a material coating which is diffuse (e.g. 
Scotch tape), or in any other way that produces the de- 
sired results. 

The present invention has been described in terms 
of a preferred embodiment. The invention, however, is 
not limited to the embodiment depicted and described. 
Rather, the scope of the invention is defined by the ap- 
pended claims. 
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Claims 

1 . A polishing pad for a chemical mechanical polishing 
apparatus, comprising: 

a polishing surface; 

an aperture formed in the polishing surface, the 
aperture including a first section with a first di- 
mension and a second section with a second, 
different dimension; 

a substantially transparent plug having a first 
portion positioned in the first section of the ap- 
erture and a second portion positioned in the 
second section of the aperture; and 
means for securing the plug in the aperture. 

2. A polishing pad as claimed in claim 1 , wherein the 
plug is made of a polyu ret hane material. 

3. A polishing pad as claimed in claim t or claim 2, 
wherein the fixing means includes an adhesive ma- 
terial. 

4. A polishing pad as claimed in claim 3, wherein the 
adhesive material is made of an elastomeric poly- 
urethane material. 

5. A polishing pad as claimed in any of claims 1 to 4, 
wherein the first portion of the plug has substantially 
the same dimension as the first section of the aper- 
ture and the second portion of the plus has substan- 
tially the same dimension as the second section of 
the aperture. 

6. A polishing pad as claimed in any of claims 1 to 5, 
wherein the first portion of the plug includes a top 
surface which is coplanar with the polishing surface. 

7. A polishing pad as claimed in claim 6, wherein the 
thickness of the second portion of the plug is less 
than the depth of the second section of the aperture. 

8. A polishing pad as claimed in any of claims 1 to 7, 
wherein the first dimension is larger than the second 
dimension. 

9. A polishing pad as claimed in any of claims 1 to 8, 
wherein the plug includes a rim. 

10. A polishing pad as claimed in claim 9, wherein the 
fixing means includes an adhesive material located 
on the rim. 

1 1 . A polishing pad for a chemical mechanical polishing 
apparatus, comprising: 

a first layer having a polishing surface; 
a second layer adjacent to the first layer; 



an aperture through the first and second layers, 
the aperture including a first opening in the first 
layer with a first cross-sectional area and a sec- 
ond opening in the second layer with a second, 
s smaller cross-sectional area; • ~ 

a substantially transparent plug positioned in 
the first section of the aperture and a second 
portion positioned in the second section of the 
aperture; and 

10 an adhesive material fixing the plug in the ap- 

erture. 

12. A polishing pad as claimed in claim 11 , wherein the 
first layer has a first durometer measurement and 

J5 the second layer has a second, smaller durometer 
measurement. 

1 3. A method of forming a polishing pad, comprising the 
steps of: 

20 

forming an aperture in a polishing pad such that 
the aperture includes a first section with a first 
dimension and a second section with a second, 
different dimension; 

25 placing a substantially transparent plug in the 

aperture, with the plug having a first portion po- 
sitioned in the first section of the. aperture and 
a second section positioned in the second sec- 
tion of the aperture; and 

30 securing the plug in the aperture. 

14. A method as claimed in claim 13, wherein the se- 
curing step includes fixing the plug in the aperture 
with an adhesive. 

35 

1 5. A method as claimed in claim 1 3 or claim 1 4, where- 
in the step of forming the aperture includes remov- 
ing material from the polishing pad. 

40 16. A method as claimed in claim 15, wherein the pad 
is formed with first and second layers and the re- 
moving step includes removing the first section from 
the first layer of the polishing pad and removing the 
second section from the second layer of the polish- 
es jng pad. 
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